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Abstract 

Let A be a finite set and n > 2. This paper introduces the concept of universal simulation 
in the context of semigroups of transformations of A", also known as finite state-homogeneous 
automata networks. For m > n, a transformation of A™ is defined as n-universal of size m if it 
may simulate every transformation of A" by updating one coordinate (or register) at a time. Using 
tools from memoryless computation, it is established that there is no n-universal transformation of 
size n, but there is such a transformation of size n -I- 2. An n-universal transformation is defined as 
complete if it may sequentially simulate every finite sequence of transformations of A"; in this case, 
minimal examples and bounds for the size and time of simulation are determined. It is also shown 
that there is no n-universal transformation that updates all the registers in parallel, but that there 
exists a complete one that updates all but one register in parallel. This illustrates the strengths 
and weaknesses of parallel models of computation, such as cellular automata. 


1 Introduction 

Memoryless computation (MC) is a modern paradigm for computing any transformation of A”’, with 
A a finite set and n > 2, by updating one coordinate at a time while using no memory. Its basic 
idea was developed in laiasiEiEiEiE], and expanded in laiinitts]. The seminal example of MC is 
the famous XOR swap algorithm, which is analogous to the butterfly network, the canonical example 
of network coding (see HD- In the following paragraphs, we shall introduce notation and review the 
main definitions of MC. 

Let q be the cardinality of A. Without loss, we usually regard A as the ring Zg = 'LjqL or, when 
q is a prime power, the field GF(g). Since the cases when g = 1 or n = 1 are trivial, we shall assume 
q > 2 and n > 2 henceforth. We refer the coordinates of A” as registers and the elements of A” 
as states. Denote by G A”’ the state with 1 at its A;-th register and zero everywhere else, and by 
gO g state with zeros in all its registers. For any a € A"", we denote by Oj the image of a under 

the f-th coordinate projection. 

We are interested in studying transformations of A”, i.e., functions of the form A"' —>■ A"'. Denote 
by Tran(A”) the set of all transformations of A"', and by Sing(A”’) and Sym(A"') the set of all singular 
and nonsingular transformations of A”, respectively. The sets Tran(A"') and Sing(A”), equipped with 
the composition of transformations o, form semigroups called the full transformation semigroup on 
A"' and the singular semigroup on A”, respectively. The set Sym(A"'), equipped with o, forms a group 
called the symmetric group on A”. 

In general, if V is a subset of a semigroup S', let {Y) be the smallest subsemigroup of S containing 
Y. Say that V is a generating set of S when S = (V). In particular, if S is a subsemigroup of 
Tran(A”), and V is a generating set of S, the triple (A”, S, Y) is referred as a finite state-homogeneous 
automata network (see HD p. 200]). 

Throughout this paper, we adopt the convention of applying functions on the right; then {x)f 
denotes the image of x G A*^ under / G Tran(A”'), and fog (or simply fg) denotes the composition 
of functions (x)(/ o g) = ((x)f)g. The size of the image of a transformation is referred as its rank. 
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We view each transformation of as a tuple of functions / = (/i,..., fn), where fi : A 

is referred to as the i-th coordinate function of /. In particular, an i-th coordinate function is trivial 
if it is equal to the i-th projection; {x)fi = Xi, for all x € A^. When considering a sequence of 
transformations, we shall use superscripts in brackets, e.g. ■ ■ ■, G Tran(^"'). 

The following is the key definition of memoryless computation. 

Definition 1 (Instruction). An instruction of A” is a transformation / : A"' —)• A^ with at most one 
nontrivial coordinate function. A permutation instruction is an instruction which maps A" bijectively 
onto A"". 

The previous definition implies that the identity transformation of A'^ is an instruction. We denote 
the set of instructions of A" as X(A"'), and the set of permutation instructions as X(A"). We shall 
simply write X and X when there is no ambiguity. Note that any nontrivial instruction / G X is 
uniquely determined by its nontrivial coordinate function fi] hence, in this case, we say that / updates 
the Tth register, and we shall often denote / by its update form: 

f :xi^ {x)fi. 

For instance, if A = GF(2) and n = 2, then X is given by 

{xi^xi, xi ^ xi + I, xi^xi+X2, xi^a:i+X 2 + l, 

X2-<^X2, X2 ^ X2 + 1, X2-^Xi+X2, X2 •(- + X2 + 1}, 

where the identity may be represented by either xi ■<— xi or X 2 X 2 - 

One of the most important features of the instruction sets X and X is that they are generating sets 

of Tran(A"') and Sym(A"), respectively (see [2l fT^i. 

Definition 2 (Program). For any g G Tran(A”), a program of length i computing 5 ; is a sequence of 
instructions f^^\ ■ ■ ■, f^^'^ G X such that 

g = /(b o ... o /W. 

Unless specified otherwise, we assume that every instruction in a program is different from the 
identity. Moreover, since the set of instructions updating a given register is closed under composition, 
we may always assume that updates a different register than for all k. In this paper, we 

shall work with particular subsets of instructions Y X X. Hence, for any transformation g G (T), 
we define the memoryless complexity of g with respect to Y as the minimum length of a program 
computing g with instructions from Y. The memoryless complexity of g with respect to X is simply 
called the memoryless complexity of g. 

Example 1. In order to illustrate our notations, let us write the program computing the swap of two 
variables, i.e. 5 ; —>■ where {xi,X 2 )g = {x 2 ,xi). It is given as follows: 

9 = /<‘)„/P)o/P). 

where 


Y: 

Xl 


Xl + X2 

Y: 

X2 

<r- 

H 

1 

to 

Y: 

Xl 


Xl — X2 


or, equivalently 

{xi,X 2 )f^^^ = (a:i + X 2 ,X 2 ), 
(xi,X2)f^^^ = (xi,xi - X2), 
(xi,X2)f^^^ = (xi - X2,X2). 
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This paper is organised as follows. In Section [21 we introduce our notion of simulation, which 
is a way of computing a transformation of using m > n instructions that may depend on m — 
n extra registers. We say that a transformation of is n-universal if the instructions induced 
by its coordinate functions may simulate any transformation of A^. We show that there is no n- 
universal transformation that uses no extra registers, but that there is one that uses only two extra 
registers. Then, we construct an n-universal transformation with maximal time of simulation g"'-|-0(n), 
and conjecture that is the lower bound for the maximal time of simulation of any n-universal 
transformation. 

In Section [3l we introduce the notion of sequential simulation. We say that an n-universal transfor¬ 
mation is complete if it may sequentially simulate any sequence of transformations of . We establish 
that any such transformation requires at least n extra registers, and we construct one with n + 2 extra 
registers when q > 3, and n + 3 extra registers when q = 2. Then, we establish lower bounds for 
the maximal and minimal time of simulation of complete n-universal transformations, and construct 
explicit examples that asymptotically tend to these bounds. 

Finally, in Section U we show that there is no universal transformation that updates all the 
registers in parallel; however, we construct a complete n-universal transformation that updates all but 
one register in parallel. The first result shows that some asynchronism is required in order to obtain 
universality; conversely, the second result shows that the least amount of asynchronism is enough to 
obtain universality. 

Our work in this paper differentiates in several aspects from results on universal cellular automata 
(see e.g. na IS]). One of these main differences is that simulation on cellular automata usually 
assumes an initial configuration of the input states. On the other hand, our work requires no ini¬ 
tial configuration: an n-universal transformation is able to simulate every transformation using any 
configuration of the states of as inputs. This is illustrated by the results in Section H especially 
in the complete n-universal transformation that updates all but one register in parallel. Indeed, this 
transformation only uses asynchronism to reset a counter, i.e. to place the state in a special initial 
configuration; once this is done, the parallel updates are then complete n-universal. 


2 Simulation of transformations 


Denote [n] := {!,...,n}. For m > n, let pr[„] : A'^ — A^ be the [n]-projection of A^ to A'^, i.e., 
(xi,..., Xm)W[n] = (xi, • • • , Xn). For any f : A^ ^ A^, define 

Sf := < Tran(A”^), 

where : A^ — A^ is the instruction induced by the coordinate function fp. 

: Xi ^ {x)fi. 


Definition 3 (Simulation). Let m > n > 2. We say that / : A^ A^ simulates g : A^ — A^ if 
there exists h £ Sj such that prj„] o g = h o pr^^j. The time of simulation, denoted by tf{g), is the 
memoryless complexity of h with respect to 

Compare our previous dehnition of simulation with the definition of simulation by projection for 
finite state-homogeneous automata networks that appears in m p. 208]. 

Definition 4 (n-Universal). Let m > n > 2. A transformation / : A"^ A"* is called n-universal 

of size m if it may simulate any transformation in Tran(A"'). The time of / is := inax{tf{g) : g € 
Tran(A”)}. 


Example 2. For any n > 2, there is an elementary example of an n-universal transformation / : 
A™ —>■ A™, with size m = 2n -|- In order to describe it, we first let Q := and enumerate by 

P^^\ 1 < s < Q^, all the transformations in Tran(A”'). For 1 < f < n, define the coordinate functions 
fi of / as follows: 


ix)fi 


(a^)pi'[2n]\[n] if X2n-i-s / X2n+Q^-i-s and X2n-i-r 

Xi otherwise. 


X2n+Q^+r (V(l <r < Q^),r 7 ^ s) 
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The rest of the coordinate functions of / are: 


II 

1 

S 

{n + 1 <r < 2n) 

{x)fj — 

{2n + l < j <2n + Q^) 

{x)fk = Xk + f, 

{2n + Q'^ + l<k<2n + 2Q” 


The main idea in this transformation is to use switches, each one corresponding to a possible 
value of s (and hence describing the transformation that we are simulating) and consisting of two 
registers: 2n + s and 2n + + s. 

To show that / is indeed an n-universal transformation, let be the instruction induced by 
the coordinate function /j. Suppose that we want to simulate 1 < s < . Then, this may be 

achieved as follows. 

Step 1. Make a copy the first n registers: o • • • o 

Step 2. Turn all switches off: o ■ ■ ■ o ir(2n+Q")_ 

Step 3. Turn the right switch on: 

Step 4. Compute o ■ ■ ■ o F^'^\ 

Or more concisely, the transformation 

h := o ■ ■ ■ o o o • • • o o i?{2n+(9’*+s) ^ q ... q 

satisfies prj„] = h o prj^j. As such, the time of / is tj = n + Q” + 1 + n = + 0{n). 

In the following sections, we study n-universal transformations with minimal size and time. 


2.1 Universal transformations of minimal size 

In this section, we denote the transposition of u,v £ A” as {u,v), where, for any x € 


ix){u,v) = < 


V if X = u 
u if X = r 
X otherwise, 

V ’ 


and the assignment of u to r as (n —^ r), where 

{x){u v) = 



if X = u 

otherwise. 


For any / € Tran(A"') and g £ Sym(A”), the conjugation of / by g is := g~^fg £ Tran(A”'). 

It was determined in [TO] that, unless |A| = n = 2, there exists a set T C X of size n that generates 
the the whole symmetric group Sym(A"'); hence, the set yu{(e^ —>■ e^)} of n + 1 instructions suffice to 
generate the full transformation semigroup Tran(A”). In the following theorem, we prove there is no 
set of n instructions that generate Tran(A"^), which implies that there is no n-universal transformation 
of size n. 


Theorem 1. For any n>2, there is no transformation f £ Tran(A'^) such that Sing(A”) < Sj. 

Proof. Suppose that Y := {tA), ..., is a set of instructions that generate a semigroup con¬ 

taining all singular transformations, where F^^'i updates the z-th register. Since the composition of 
permutations is a permutation, at least one of these generating instructions must be singular. 

First, assume that at least two instructions of Y, say TA) and F^‘^\ are singular. We claim that no 
assignment g = {a ^ b), with a* ^ bi, i = 1, 2, can be computed using only instructions in Y. Indeed, 
suppose that tA) is the first singular instruction in a program computing g, so g = tt o tA) o h, for 
some h £ Tran(A"^) and vr E {F^^'>, ... , As ttoFA) is singular, there exist u,v £ A”, v, such 
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that {u)Tr o = (v) TT o which implies that {u)g = {v)g. However, as tt o F^^'> does not update 

the second register, we have {u,v} ^ {a,b}, which contradicts the definition of the assignment g. 

By the previous paragraph, there may be only one singular instruction in Y, say F^^'^. Let u,v £ A^, 
ui 7^ ui, be such that {u)F^^'^ = {v)F^^\ For any g G Sing(yl”), we may write g = tt o F^^'^ o h, where 
h G Tran( 74 ”) and tt G ..., C Sym(yl”^). Letting x = {u)tt~^ and y = (u) 7 r“^, we see that 

xi / yi and {x)g = {y)g. However, this means that assignments such as g = {a ^ b), with a ^ b, 
ai = bi, cannot be computed. □ 

Corollary 1. For any n >2, there is no n-universal transformation of size n. 

It is an open question whether there is an n-universal transformation of size n -|- 1. However, we 
shall find an n-universal transformation of size n + 2 and time at most 6 [log 2 (( 7 )] {q — + 0{q^). 

Before this, we need the following result of memoryless computation. 

Theorem 2. Let |A| = q and n > 2. Then Tran(yl”) is generated by a set of instructions Y, containing 
at most q instructions per register, such that any transformation of has memoryless complexity 
with respect to Y of at most 3 |'log 2 ((?)] (<? — l)nq'^~^ + 0{q'^). 

Proof. We consider the following instructions; 


j'W ■ 

xi ^ xi -|- S{x, e°) — S{x, e^). 


A(2) : 

X 2 X 2 + 6{x,e^), 


/«: 

xi ^ xi + 1 — 6{x, -|- 6{x, {q — l)e^). 


/(b . 

Xi ^ Xi + l - 5{x, Ae*), 

AeA 

(for 2 < i < n 


where 5{x,y) denotes the Kronecker delta function, and Ae* is the state with A G A in its i-th register 
and zero elsewhere. In order to simplify notation, we shall identify x G A"^ with its lexicographic 
index Yji=i ^ {0; 1; • • • > 9*^ “ !}• With this, we may write A^"^^ = (0 —> g) and = (0,1). 

Observe that the instructions are permutations with the following cyclic structure: consists 

of one cycle of length q — 1 and — 1 cycles of length q, while, for 2 < i < n, the instruction 
consist of just q^~^ — 1 cycles of length q. 

Let p := [log 2 (q')] and define 

Y := |r(i), g 1(2), ; 1 < i < n, 0 < j < p - l| . 

We shall follow several steps in order to prove that Y is the required generating set. 

(i) Any transposition := {0,k), with k G A^, has memoryless complexity with respect to Y of 
at most pw{k) + 0 ( 1 ), where w{h) is the number of non-zero coordinates of k. 


Proof. First, we determine the memoryless complexity of for 1 < i < n and 1 < A < (7 — 1 

with respect to Y. Using the binary expansion A = £ {0,1}, it is clear that 

(jW)A ^ (j(i))Ai o ((/(d)2)Ai ^ ^ 

Thus, we need at most p instructions from Y to compute 

Fix k G A^, and suppose that 1 < ji,... ,jw < n, with w = w{k), are the non-zero coordinates 
of k. If k is not a multiple of q (i.e. ji = 1), we have 

rW:=(o,fe) = (r«)^ 

while if /c is a multiple of q, we have 

The result follows because ^ for any 1 < A < g — 1 and 2 < i < n. □ 
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(ii) Any permutation in Tran(A"') has memoryless complexity with respect to V of at most 2p(q — 
+ 0{q^). 

Proof. Note that any transposition (a, b) may be expressed as 

(a, 6) = 

Since any permutation with r non-fixed points may be expressed as at most r — 1 transpositions, 
cyclic permutations of length q^ have the maximum memoryless complexity. In particular, if 
TT = (oi, 02 ,..., Oq-n.) € Sym(A'^), then 

TT = (oi, 02) . . . (Oq^-l, Oqn) 

— (^J'{a2)'J'{a.-l)rp[a3)rp[a2)'\^ (J'io.qn-l)rp{aqn _ 2 )'J-'{aqn) rj-i{aqn _i)'^ 

In this decomposition, and appear once, while every other transposition ^ 

s 0 {l,g”}, appears twice. By step (i), requires at most pw{as) + 0(1) instructions from 
Y. Since 


w{k) = ^i(g- 1)* 

k&A^ 


i=l 


n 


. = (g - l)ng' 


n—1 


( 1 ) 


it takes at most 


q^-l 


2 Y ipw{as) + 0(1)) < 2p{q - l)ng" ^ + 0(g”) 


s=2 


instructions from Y to compute vr. 


□ 


(iii) Any transformation in Tran(A”') has memoryless complexity with respect to Y of at most 3p{q — 
l)nq^-^ + 0{q^). 

Proof. Let g be any transformation of rank r < g"'. Consider the partition ker( 5 f) := {Pi,..., P^} 
of A” induced by the following equivalence relation: a 6 if and only if {a)g = {b)g. (This 
equivalence relation is called the kernel of g). For 1 < i < r, let Pj = {pip,... ,pi^ni}- Depending 
on two cases, we shall find a transformation h such that ker(g) = ker(h), which implies that 
g = ho TT for some vr € Sym(A"'). 

Case 1: States 0 and q are in a same set of ker(g). Without loss, assume pip = 0 and pip = q. 
Then, define 

h := A^‘^\q,p2 i)r(P2. 2 )^( 2 ) ^ ^ ^ j^{p r,nr )^(2). 

Case 2: States 0 and q are in distinct sets of ker(( 7 ). Without loss, assume piq = 0 andp^^^^ = q. 
Then, define 

h := (0,g)r(Pi’2)^(2) ^. .r(Pi."i)A(2)(g,p2,i)F(?’2.2)yi(2) ^ ,,T^Pr,^r-i)A(‘^\pj^2,q), 

where j is the smallest index for which pjp exists. (Clearly, such an index j always exists 
because g does not have full rank.) 

Each transposition in h takes at most pw{pij) + 0(1) instructions and each assignment takes 
0(1) instructions. The result follows by equation ([T]) and step (ii). □ 


□ 


Theorem 3. There exists an n-universal transformation of size n + 2 and time at most 

6riog2(g)l(g- l)ng’"“^ + 0(g’"). 
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Proof. Let p := [log 2 (g)]. We consider the generating set of instructions Y given in the proof of 
Theorem [2j For each instruction in Y, we denote the corresponding nontrivial coordinate function in 
lowercase, e.g., the nontrivial coordinate function of ^ is {x)i1 = xi+2 — 26{x, e^)+26{x, {q—l)e^). 
Consider the transformation / € Tran with coordinate functions defined as follows: 


{x)h 


/ 

(x)pr[„] O ii if Xn+l - Xn+2 = 0 

(x)pr[„] o ij if Xn+l - Xn+2 = 1 


(a:)pr[„] o if ^ if Xn+i - Xn+2 = P - 1 
_ (x)pr[„] O ti if Xn+l - Xn+2 = P, 


{x)f2 


/ 

(x)pr[„] o Z 2 
(x)pr[„] o i\ 

(x)pr[r,] O if 

, (a^)pr[n] o “2 


if Xn+l Xn+2 — 0 
if Xn+l Xn+2 — 1 


if Xn+l Xn+2 — P 1 
if Xn+l Xn+2 — P; 


{x)fj 


/ 

(x)pr[„] O ij if Xn+l - Xn+2 = 0 

(x)pr[„] O i2 if Xn+l - Xn+2 = 1 


(x)pr[„] O if ^ if Xn+l - Xn+2 = P - 1 
Xj if Xn+l Xn+2 — P; 


(3 < j < n), 


(.^')fn+l — Xn+2^ 


{x)fn +2 = Xn +2 + 1- 


The main idea behind the definition of / is that the additional registers work as a switch to decide 
which instruction the program shall use. 

Let € Tran(A"'+^) be the instruction induced by the coordinate function /j. For any g G 
we may now find h ^ Sf = such that pr[„] o g = h o pr[„]. Suppose that g = 

giP o o • • • o g^^\ where g^^^ G Y. By grouping together the powers of we may assume that 
gik) ^ Y U {(/(•^))^ : 1 < A < g — 1}, so £ < 3{q — l)nq^~^ + 0{q^). Denote A = Yli=i with 

Xi G {0,1}. Let and for each 1 < A: < £, let 




' (_p{^*+2) y _p(l) jrin+l) gik) _ j^(l) 

, ('_p{n+2)^p_p(2)^(n+l) if 

(^p'^3)yiYin+2)^jr{j)y2 _ _ jr{n+2)^p{j)ypp{n+i) [f giF _ 


Therefore, we may take h = ■ ■oh^^\ which uses at most 2pl instructions from {F^^\ ..., 

This shows that / is an n-universal transformation of size n + 2 and time 6p{q — + 0{q^). □ 

Remark 1. For g = 3 or q = 5, there is a simpler n-universal transformation of size n -|- 2 whose time 
is strictly less than the time of the n-universal transformation / constructed in the proof of Theorem 
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[3l Defining 


{x)fi = 1 

f (a:)pr[n] o ii 
[ (3;)pr[n] o ti 

if — ^nH-2 

if 7^ ^nH-2; 

ix)f2 = 1 

\ (a;)pr[n] o ^2 
[ (a;)pr[n] o “2 

if ^n+1 — ^n+2 
if ^n+1 7^ ^n+2 

{x)fj = (x)pr[^] oij, 


(^)/n+l — ^n+2? 


(x)/n +2 = 1 

f Xn+2 if 

[xn+2 + 1 if 

^n+1 7^ ^n+2 

^n+1 — ^n+2? 


we obtain an n-universal transformation / of size n + 2 and time tj{n) = 3{q — l)ng”' + 0{q^). 

Observe that, for g = 7 or > 9, we have t^(n) > tj(n), while, for q € {2,4, 6,8}, tj{n) = tf{n). 

However, for g = 3 or g = 5, we have tj{n) < tj(n); Table [U compares explicitly the times of / and /. 



t^-(n) t/(n) 

g = 3 

g = 5 

6n3^ + 0(3”) 8n3” + 0(3”) 

60n5”-i -h 0(5”) 72n5”-i -F 0(5”) 


Table 1: Times of / and /. 


2.2 Universal transformations with minimal time 


In the next theorem, we construct an n-universal transformation that, instead of having switches 
consisting of two registers (as in Example [2] or Theorem [3]), it has a switch consisting of a one-error 
correcting code. 

Theorem 4. For any n >2, there exists an n-universal transformation with time g"' -|- 0{n). 


Proof. We shall construct an n-universal transformation / with the required time. Foremost, we need 
the following notation. Let A = Zq and p := [log 2 gj. We then consider the following functions: 


odd ; H ^ GF(2) 


(a)odd 


1 if a = 1 mod 2 
0 if a = 0 mod 2, 


err : A ^ A 

,, |a-|-l ifo<g — 1 

(a)err = { 

la — 1 it a = g — 1. 

By applying them component-wise, we extend these functions to odd ; A^ —>■ GF(2)^ and err : A^ —>■ 
A^, for any k>\. We also identify any v G GF(2)^ with its lexicographic index X]f=i 





Moreover, let Q := and h := Q + [log2<3] + 1, and consider the (n, Q, 3)-shortened Hamming 
code C in systematic form (see M)- Let M G GF(2)‘3^”' be the generator matrix of C, and, for LT C [h], 
denote by Mh the matrix formed with the H-th columns of M. As C is a one-error correcting code, 
let 


dec : GF(2r ^ {0,...,h} 

, . , \ j if t; = c -|- e-^ for some c G C 

(njdec = < 

0 otherwise. 


Let {A^)^ = ... ,7('3)} be the set of all coordinate functions A"’ —>■ A, and let r := 

[([log2 Q\ +1)/p\. The n-universal transformation / G Tran(A™'), with m = 2n + Q + r, is given as 


follows: 


f Xi if (x)pr[^]\[2n] odd dec > Q, 

\ otherwise, 

(1 < i < n). 

{x)fj = Xj-n, 

(n -|- 1 < j < 2n), 

{x)fk = (xfc)err, 

(2n + 1 <k <2n-\- Q), 

{x)fl = ((a;)pr[2n+(3]\[2n] Odd^ G[Q+/p]\[Q+(;_i)p], 

(2n + Q-\-l<l<2n-\-Q + r 


The different parts of the transformation may be intuitively explained as follows: 

• registers n -|- 1 to 2n maintain a copy of the original configuration of registers 1 to n; 

• instructions , ■ ■ ■, are used to describe the coordinate function gi, where each register 

k from 2n -|- 1 to 2n -|- Q holds the bit (xfc)odd while each register from 2n -|- Q -|- 1 to m holds p 
bits; 

• registers 1 to n then have access to their original configuration and to the coordinate function 
they need to simulate. 

The transformation g = \ ..., G Tran(A"’) is simulated as follows: 

Step 1. Make a copy of the first n registers: o • • • o 

Step 2. Encode (x)pr[2„ +Q]\[2n] iiito a codeword of C: o • • • o pC^n+Q+r)^ 

Step 3. For i from 1 to n do: 

Step 3.1. Add an error for _ 

Step 3.2. Compute 7*^^'^: F^*). 

Step 3.3. Remove the error: 

The time is then given by 

n -|- r -|- 3n = ^ -|- 0(n) = + 0{n). 


Conjecture 1. Any n-universal transformation has time at least . 

Provided Conjecture [T] holds. Theorem 0] indicates that the minimum time of an n-universal 
transformation is t(n) = q^ + 0{n). As such, Theorem [3] indicates that we can achieve a time of 
0(t(n) logt(n)) with only two additional registers. 
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3 Sequential simulation of transformations 

Definition 5 (Sequential simulation). Let m > n > 2. We say that / : sequentially 

simulates € Tran(A”) if there exist ..., & Sf C Tran(^™') such that, for any 

l<i<£, 

Pr[n] o 9 ^'^ = o ■ ■ ■ o ° Pr[n]- 

The sequential time of simulation, denoted by stf{g^^\...,g^^')), is the memoryless complexity of 
o ■ ■ ■ o with respect to 

Definition 6 (Complete n-nniversal). A transformation of A™' is called complete n-universal if it may 
sequentially simulate any finite sequence in Tran(A’^). 

Define the maximal and minimal sequential times of f, denoted by st™*^^ and st™“, respectively, 
as follows: 

st““ = max|st/(5f(^\... for i / j| , 

st““ = min I St/(5^^^ ... + 9 ^^^ for i / j} • 

As the sequences considered in the above definitions must include each transformation in Tran(A”') 
exactly once, the relevant aspect when calculating maximal and minimal seqnential times is the order 
in which the transformations of A^ appear in the sequence. 

Example 3. The n-universal transformation of Example [5] is in fact a complete n-universal transfor¬ 
mation. Let g^^^ = = p^^^\ then this sequence can be simulated as follows. 

Step 1. Make a copy of the first n registers: o ■ ■ ■ o 

Step 2. Turn all switches off: o • • • o 

Step 3. For j from 1 to £ do: 

Step 3.1. Turn the switch on: 

Step 3.2. Compute F^^^ o ■ ■ ■ o F^^\ 

Step 3.3. Turn the switch off: 

The maximal sequential time of / then satisfies 

st-^< (n + 3 + o(l))Q-. 

By considering a sequence where and g^^'^ only differ by one coordinate function (i.e. a Gray code, 

which we shall use later on). Step 3.2 can be simplified by updating only the register corresponding 
to the differing coordinate function. The minimal sequential time of / then satisfies 

stf"< (4 + o(l))Q". 


3.1 Complete universal transformations of minimal size 
Theorem 5. The size of a complete n-universal transformation is at least 2n. 


Proof. Let / be a complete n-universal transformation of size m > n. Identify the elements x of A"" 
with their lexicographic index consider the sequence g^^^ , g^^^ , ■ ■ ■, G Tran(A”) 

defined by 


(x)g(^^ 


1 if X = k, 

0 otherwise. 


By definition of complete n-universal, there is a sequence ..., G Sf such that, for any 

0<i<q^ -1, 


Pr[n] o 9 ^'^ = o • • • o ° Pr[n]. 
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We claim that the rank of —)• is at least g". Indeed, if 0 < /ci, A;2 < g"" — 1 satisfy 

{ki)h^^"> = {k 2 )h^^\ then 

{ki)h^^^ o o • • • o o pr[„] = {k2)h^^^ o o ■ ■ ■ o o pr[„], 

l = (fei)ff("i) = {k2)g^’^^l 

Therefore, ki = k 2 , which means that all the states of A^ have distinct images under Since 

{A^)g^^"> = {0,1}, and 0 is the unique state with image 1 under g^^\ we have 




> g" - 1. 


This implies that (^™')pr[^]y„] has at least g” — 1 states, which is only possible when m > 2n. □ 

Theorem 6. Let \A\ = q > 2. Then, there exists a complete n-universal transformation f of size m 
and sty^^{n) as given in Tahle\^ 



m st™'^^(n) 

q = 2 

q = 3 or q = b 

g = 4 or (7 > 6 

2n + 3 3n2'^2"+n ^ q^2^2^+^) 

2n + 2 6{q - 

2n + 2 6[log2(g)l {q - + 0{q^<i"+^) 


Table 2: Complete n-universal transformations. 


Proof. Assume first that g = 4 or g > 6, and let /? := [log2 q] ■ Consider the n-universal transformation 
/ of size n + 2 constructed in the proof of Theorem [3l Now, define the coordinate functions of / by 


{x)fi 


Xn+3+i if ^n+1 Xji^2 — P ~k ^ 

(a^)pr[n+2] o fi otherwise. 


(1 < z < n). 


{x'lfn+l — Xji+2j 
{x)fn+2 = Xn+2 + 1, 

{x)fj = Xj-n- 2 , {n + 3 < j <2n + 2). 

Intuitively, registers n -|- 3 to 2n -|- 2 maintain a copy of the original configuration of registers 1 to n; 
again, registers n -|- 1 and n + 2 indicate which coordinate function to use but now the position p + 1 
indicates that /i, 1 < i < n, must copy back the original values of the input from registers n -|- 3 to 
2n -|- 2. 

Let and be the instructions induced by the coordinate functions fi and fi, respectively. 
Suppose that we want to sequentially simulate g^^\ ... ,g^^^ G Tran(A"'). Since / is n-universal, there 
exist h^^\ ..., € Sf = ,..., such that pr[„] o 51W = o prj^]. For 1 < z < ^, define 

Sj hy replacing every instruction F^^'i in /ib) by F^^f Let 

(j p(n+3) p(2n+2) ^ ('^(»^+2)^p+l (-^(1) _ _ _p(n.)^^(n+l) ^ 

Then, for every 1 < z < £, we have 

pr[„] o o • • • o opr[„]. 


II 







By Theorem [3l each has memoryless complexity of at most Qp{q — + 0{q^). Hence, 

sequences of length I = have maximal sequential time of ^p{q — 

For g = 3 or g = 5, let p := 1, and use the above construction of / with /, as in Remark [H instead 
of /. 

The proof for g = 2 is very similar. The main difference is that, as the first and second coordinate 
functions must choose amongst three possibilities (ii, ti, or x„+3, and 12 , 02, or Xn+A, respectively), 
a switch consisting of two registers does not suffice; however, a switch of three registers is enough for 
our purposes. More formally, we now define the transformation / E Tran(H^®®'’'^) by 


{x)ii 


^n+3+ii 

{^)W[n+2] O fi 


if Xn-\-2 7^ ^n+3 
otherwise, 


(1 < i < n). 


{x)fn+i 


^n+2? 


{x)fn+2 


Xn+2 + I5 


{x)fn+3 


^n+2 5 


{x)fj = Xj-n- 3 , (n + 4 < j < 2n + 3). 

Using a similar notation as above, define 

(J pin+i) p{2n+3) p pin+3) ^p{l) p{n) p{n+3) ^ 

For 1 < i < i, define E S'j by replacing every instruction in /j,(®) by pi^^) for A: < n + 1 and by 
replacing every instruction in /j(*) by _p(’^+2)^{«+3)^ Then, for every 1 < i < £, we have 

pr[n] o 5(1) o • • • o ... (R/i(®)) opr[„]. 

The time analysis is similar as before. □ 


3.2 Complete universal transformations with minimal sequential times 


Theorem 7. Let f be a complete n-universal transformation. Then, < st™‘“. Conversely, there 
exists a complete n-universal transformation f such that st™‘“ = (1 + o(l))g’®'?". 


Clearly, one always needs at least g'®®'^" updates to compute any sequence of length q '^'^", so g®®”?" < 
st™®®. In order to prove the upper bound in the theorem, we need several preliminary results about 
Gray codes. 

As usual, let |A| = g and n > 2. An (n, q)-Gray code is an ordering ..., of the states 

in A®® such that two consecutive states differ by only one coordinate: dH{a^^~^ mod ^ 

all 0 < i < g®® — 1, where dn is the Hamming distance. For any Gray code G = {a^^\ ... 
we denote the sequence C{G) = (c^^\... E where c^®^ E [n] is the coordinate in which 

a(®-i mod?®®) and a^®^ differ. A run of length I for G is a sequence c^®^..., of consecutive 

distinct elements of C{G). We say that G has r{G) runs if C{G) can be partitioned into r{G) runs. 
For instance, the canonical (n, 2)-Gray code has 2®®“^ runs. For n = 2, we have 


a(0) = 00, = 01, = 11, = 10 

c(0) = l, cW=2, c(2) = l, c(3)=2. 


For n = 3, we have 






o( 0 ) = 000, = 001, o( 2 ) = oil, a( 3 ) = 010, = 110, = 111, a ^^'> = 101, = 100, 

c(0) = 1, = 3, c(2) = 2, c® = 3, = 1, = 3, = 2, = 3. 
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Clearly, any Gray code has at least (f’jn runs; we shall construct (n,g)-Gray codes with o(g"') runs 
for even q. 

Lemma 1. For any n a power of 2, there exists an {n,2)-Gray code with o(2"') runs. 

Proof. We shall prove the result by induction on n. The code G 2 is the canonical Gray code. Suppose 
Gn = (o®, • • •, (or simply written, 0 up to 2 ” — 1 ), then G 2 n is given by 

(( 0 , 0 ), ( 0 , 1 ), ( 1 , 1 ), ( 1 , 2 ),... , ( 2 - - 1 , 2 " - 1 ), ( 2 ^ - 1 , 0 ), 

{2"- - 2,0), {2"- - 2,1),... , (2” - 3,2” - 1), (2” - 3, 0), 

( 2 , 0 ),( 2 , 1 ),...,( 1 , 2 ”- 1 ),( 1 , 0 )). 

There are 2^~^ rows, each containing 2^~^^ elements. The code G 4 is then 

G 4 = (^ 0000 , 0001 , 0101 , 0111 , nil, iiio, loio, looo, 

1100 , 1101 , 1001 , 1011 , 0011 , 0010 , 0110 , 0100 ), 

C{G 4 ) = (2,4,2,3,1,4,2,3,2,4,2,3,1,4,2,3), 

which can be partitioned into six runs (instead of eight for the canonical code). 

Let 'L(n, d) denote the set of indices i such that the next occurrence of appears at least d 
indices later. More formally, let T^ = (V, E) be the directed graph on V = {0,..., 2"' — 1} with arcs 
E = {(i, i + 1 mod 2” — 1) : i G 1/} and let d{i,j) be the length of the path from i to j in r„, then 

T(n,d) = {i : 0 < d{i,j) < d 

For any d, the Gray code Gn has at most 

|T(n,d)|/d + 2 ( 2 ”-|T(n,d)|) + l 

runs. Indeed, split 'I'(n, d) into sequences si,..., Sm of consecutive indices, where m < 2"' —|'I'(n, d)|+l. 
Each sequence st of length h <t <m) can be partitioned into \lt/d~\ <lt/d+l runs, thus requiring 
at most |'I'(n,d)|/d + 2” — |T(n,d)| + 1 runs to partition ^( 71 ,d). Moreover, the indices outside of 
'I'(n, d) can be partitioned into singleton runs; altogether, this yields |'I'(n, d)|/d + 2(2"'— |'I'(n, d)|) +1 
runs. 

Our strategy is then to find a distance d such that d = w(l) and 2"' — |'I'(n,d)| = 0 ( 2 ”). We have 
|T(n, 2)1 = 2” for all n and by construction, 

|T(2n,2d)| > 2’"|T(n,d)| -2’"d, 

since the only ones that do not follow the simply doubling pattern are the ones at the end of every 
row. Denoting the largest power of two less than or equal to log 2 n as I, we then obtain 

|^'(n,/)| > 2 G^\'I>{n/ 2 ,l/ 2 )\ - 2 ^Gi 

> 23’^/^|T(n/4,//4)| - ^ 

> 2"-2’"/'|T(2n//, 2)1 - + ... + 2 ^G) 

= 2" - o{ 2 '^). 


□ 


Lemma 2. For any n, there exists an {n,2)-Gray code with o(2"') runs. 
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Proof. This is obtained by the usual “product” construction of Gray codes. Let m be the largest 
power of two less than or equal to n. Denote the (m, 2)-Gray code from Lemma[T]by (0,1,... , 2"* — 1) 
and an (n — m, 2)-Gray code by (0,1,, 2”“"* — 1). Now, construct an (n, 2)-Gray code as follows: 

((0,0),...,(0,2™-l), 

(1,0),...,(1,2™-1), 

{2^-m -1,0),..., (2"-”" - 1, 2™ - 1)) . 

This has at most 2"'“™' ■ o(2™') = o(2"') runs. □ 

Lemma 3. For any even q and any n, there exists an {n,q)-Gray code with o{q^) runs. 

Proof. Here again, we use a “product” construction, viewing each element of [<?]’^(= A^) as an element 
of [p]"" X [2]”, where p = |. We then combine any (n,p)-Gray code with the (n,2)-Gray code from 
Lemma [2] as follows: 

((0,0),...,(0,2--l), 

(1,0),...,(1,2--1), 

(p--l,0),...,(p"-l,2--l)). 

Clearly, this has at most p” • o(2”) = o{q^) runs. □ 

For odd q, we do not use a Gray code. Instead, an (n, q)-pseudo-Gray code of length L is a sequence 
P = (p^^^,... of elements of [g]" such that every element of [q]'^ appears in the sequence and 

any two consecutive elements of the sequence only differ by one coordinate. (A pseudo-Gray code is 
a Gray code if every element appears exactly once.) Runs are defined for pseudo-Gray codes in the 
natural way and the number of runs is still denoted r(P); the redundancy R{P) of a pseudo-Gray 
code is R{P) = r{P) -\- L — q'^. 

Lemma 4. For any q and any n, there exists an {n,q'R)-pseudo-Gray code with redundaney o(g’^'?"). 

Proof. If q is even, we use the Gray code from Lemma El Suppose that q is odd. Then Q := q'R is 
odd, so, again by Lemma El there is an (n, Q — 1)-Gray code G with o{Q'^) runs. We shall construct 
an (n, (5)-pseudo-Gray code by using G first, and then enumerating all of the remaining states in [Q]”. 
It takes at most n steps to go from of these remaining states to another, and there are — {Q — 1)^ 
of them. Thus, the redundancy of this pseudo-Gray code is at most 

2niQ^ - (Q - 1)-) + oiQ^) = 2n{q^^^ - {q^" - 1)^) + o{Q^) 

< 2n • + o(Q^) = o(Q"). 


□ 


Finally, we may prove Theorem [71 

Proof of Theorem We explicitly construct the complete n-universal transformation / of the state¬ 
ment of the theorem. Let Q = q'^" and P = (p*'^^ = id,... ,p^^~^^) be the (n,Q)-pseudo Gray code 
of LemmaEl We use the notation C{P) = ... , 0 ^^“^^), which is partitioned into r = r(P) runs 

Pi = ...,... ,Rr = ...,and 

T : [r] X [n] ^ {0,..., Q” - 1} 

f A if 3A : z = CA G Ps 

{s,i)t = < 

1 0 if i ^ P,. 
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LetG = bea (cj, g)-Gray code, where a = [log„ r] +1. We use the index function 


ind;yl‘^^{0, 

(x)ind = j such that aj = x 

and C'(G') = (cq, ..., Cq<T_i). We denote the successor function for this code as 

S : A'" ^ 

(aj)S = UQ+i) mod(g-) (0 < i < - 1). 

Let m = 2n + a + 2. Then the transformation / is defined as follows: 

(((a:)pi-[2n+i+a]\[2n]i°'i, i)T) 


{x)fi = (x)pr[ 2 „]\[„]Pi 


(1 < i < n), 


{x)fj = Xj-n, 


{n + 1 < j < 2n), 


ix)fk = 


(2^)pr[2n+(T]\[2n] 2n if 1 

1 if Xfn ^ Xm—l} 


{2n + 1 < k <2n + a), 


{x)fm-l = X 


m 5 


{x)fm = Xm + I- 

Intuitively, registers n + 1 to 2n maintain a copy of the original conhguration of registers 1 to n; 
registers 2n + 1 to 2n + cj form a counter indicating the run number in the pseudo-Gray code P; 
registers m — 1 and m form a reset switch for the run counter. 

The program computing P = = id,... in order goes as follows. 

Step 1. Make a copy of the hrst n registers: o • • • o 

Step 2. Reset switch on: pi™ 

Step 3. Reset run counter: p(2"'+i) o • • • o _p(2n+o-)^ 

Step 4. Reset switch off: pi*”). 

Step 5. For s from 1 to r do: 

Step 5.1. Compute topi'’'’“^i: p('^’’('’-i)) o • • • o pi‘^’'s-i); 

Step 5.2. Increment the run counter: pi'^n+cs+i)^ 

Total time: 

r 

n + l + a + l + ^(^5 - Ts-i + 1) = L + r + O(logr) = Q” + o((5"'). 

S=1 

We finally prove that this transformation is complete. Let G Tran(^”') be a sequence of 

transformations. It is clear that applying Step 1 and then repeating i times Steps 2 to 5 will simulate 
i times the full sequence p^^\... ,p^^~^\ As such, is simulated during the first iteration, 
during the next, and so on until □ 

Theorem 8. Let f be a complete n-universal transformation. Then, Conversely, 

there exists a complete n-universal transformation f sueh that = (n -|- I -|- o(l))g"''?". 
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Proof. Viewing any coordinate function —> A as an element in Zg, with Q = > 4, we give an 

ordering of Tran(74”) = Zq such that any two consecutive transformations differ in all n coordinate 
functions: 

(Q - 1 ,..., Q - 1 , 0 ), ( 0 ,... , 0 , 1), ..., (g - 2 ,..., Q - 2 , Q - 1)) . 

Clearly, the time of simulation of such a sequence of transformations is at least so ng”'^" < 

The construction of / is similar to that of Theorem 0| it is still based on Hamming codes, except 
that it describes a whole sequence of transformation of at once. Let p := [log 2 gj • We shall 
re-use the notation of odd and error introduced in Theorem SI Moreover, for k := g”*^ and h ■.= k + 
[log 2 -|- 1, consider the (h, k, 3)-shortened Hamming code C in systematic form. Let M € GF(2)^^”' 
be its generator matrix and, for H C [h], let Mh be the matrix formed with the H-th columns of M. 
This is a one-error correcting code, so let 

dec : GF(2)” ^ {0,...,h} 

\ j if u = c -b e-^ for some c G C 
(ujdec = < 

I 0 otherwise. 

We denote (Tran(H”'))‘3 = {F^^^ ... where F^-^^ = ■ ■ ■,and r = |'(|'log 2 k'\ + 

l)/p~\ . Let m := 2n + k + r + + 2. Let {x)l € {0,..., g - 1} be the index of (x)prj^_ 2 ]y[ 2 n+fe+r] 

according to the (g”, g)-Gray code G = (a®,..., and let C{G) = The index 

{x)l shall work as an counter to decide which transformation will be simulated. For x G A^, define 
(x)r := (x)prj 2 ,^^j,_,_,,j^[ 2 n]°'^‘^'^®^' complete n-universal transformation / is given as follows: 


(.x)fi = j 

f Xi, if {x)t > k, 

[ (a;)pr[2n]\[n] o , Otherwise, 

(1 < z < n), 

fi 

1 

II 

(n -b 1 < j < 2n), 

{x)fk = (a;fc)err, 

(2n -b 1 < A: < 2n -b fe), 

{x)fi = ( 

J^)pr[2n+fc]\[2n] ^dd) , 

(2n -b -b 1 < Z < 2n -b -b r), 

(.x)fs = j 

r ((a:)Z+l mod(Q)) -r _ 

1 — ^m—lf 

1^0 if Xm ^ Xttx—I, 

(2n -bfc-br-bl<s<m — 2), 


1 — Xmi 

T !• 

Let A > 1 and A = gA. The sequence gf length A is 

simulated as follows: 
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Step 1. Make a copy of the first n registers: o • • • o 

Step 2. Encode (a:)prj 2 „_|_^jy 2 n] ^ codeword of C: ir( 2 n+fc+i) g ... o ^( 2 n+fc+r)_ 

Step 3. Reset the counter {x)l\ o o ^(2’"+^+^+^) o • • • o ^(”^-2) o 

Step 4. For /i from 1 to A do: 

Step 4.1. Add an error to f(2’^+*^‘). 

Step 4.2. For a from 0 to Q — 1 do: 

Step 4.2.1. Compute F^^^ o • • • o F^”'^ 

Step 4.2.2. Increment the counter {x)l according to the Gray code: F^2’^+fc+^+c^‘^+^')^ 
Step 4.3. Remove the error: f(2"-+v). 

For A = Q^, the time to simulate this sequence is then given by 

n + r + + 3) + ^{Q{n + 1) + 2) = (n + 1 + o(l))Q’*. 


□ 

Note that the complete n-universal transformation / of size 2n + 2 (or 2n + 3 when q = 2) 
given in Theorem El does not have a very high sequential time compared with complete n-universal 
transformations of minimal sequential time; indeed, we may see that st™®'^ is equal 0(st™’^^ logst™*™), 
with / as in Theorem [8l 

4 Simulation of transformations in parallel 

So far, we have looked at sequential updates (i.e. one register at a time). This is a strong constraint 
for MC: if we were allowed to update all registers at once, then any function could be computed in 
only one time step. However, in our model of universal simulation, this is actually a strength and a 
necessity. 

We extend our framework to consider the following type of simulations. 

Definition 7 (Parallel simulation). Let m > n > 1. We say that / : A™' —>■ A™' simulates in parallel 
g : A"' —>■ A"' if there exists h € (/) such that pr^^] o g = ho pr^^]. 

We also consider the slightest form of asynchronism in sequential simulations. For / G A™ —>■ A™, 
define : A"^ ^ A"^ by 

(x)f([”"“^]) := {{x)fi,{x)f2,...,ix)fm-i,x„,). 

Definition 8 (Quasi-parallel sequential simulation). Let m > n > 1. We say that / : A™' — >■ 
A™ sequentially simulates in quasi-parallel g^^\ ..., g^^') G Tran(A"') if there exist G 

i?M) such that, for all 1 < f < ^, 

Pr[n] o 9 ^'^ = o PI'[n]> 

and the instruction F^™^ appears at most once in the program o • • • o . 

Say that / : A™ —A™ is a complete quasi-parallel n-universal transformation if it may sequentially 
simulate in quasi-parallel any finite sequence in Tran(A”). 

Theorem 9. For any m > n > 1 , there is no transformation f : A™' — A™ that may simulate in 
parallel every transformation in Tran(A"'). However, for any n > 1, there exists a complete quasi¬ 
parallel n-universal transformation. 
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Proof. Suppose that / : may simulate in parallel every transformation in Tran(^”). For 

a,b ^ A^, a ^ b, consider the constant transformations G Tran(A”) defined by {x)g^'^^ = a 

and {x)g^^^ = b, for all X G A^. Then, by definition of parallel simulation, there exist integers ka < kb 
such that, for all x G A^, 

o pr[„] = (x)pr[„] o 5^“) = a and {x)f^>’ o pr[„] = (x)pr[„] o = b. 

But now we obtain a contradiction: 

b = {x)f>‘» o pr[„] = f^-^ o pr[„] = a. 

Let \A\ = Q = and denote Tran(74”) = ... ,p(*3"+i) = moreover, let p®, and 

p(c) for c > + 2 all be equal to the identity. Let m = (Q” + l)n + q^ + 3. Now we shall construct a 

complete quasi-parallel n-universal transformation /. Let C G {0,..., qQ^ — 1} be the lexicographic in¬ 
dex of (x)pr[(Q„_|_2)„_|_qn_|_r]\[(Qn_,_2)„]. We further denote fc ■= (/(Qn+2)n-i-i) • • •) f{Q'^+2)n+q^+i)- Define 
/by: 

{x)fi = (a;)pr[(c+i)„]\[Cn]Pr^ (1 < f < n), 

{x)fj=Xj-n, (n-Fl < j < ((5” + 2)n), 


{x)fc = 


{C + 1) mod (Q” -I- 2) if Xm = Xm-i 
2 if Xm 7^ 3^m—1) 


— ^rri; 

— H“ 1* 

:= (/i,...,/m-i)* We prove that / is a complete quasi-parallel n-universal trans- 

formation by giving a program simulating the sequence repeated £ times for any 

£> 1 . 

Step 1. Do Note that this makes C equal to C-l-l mod {Q'^ + 2) or 2, and turns the switch 

off. Xm—l — Xm‘ 

Step 2. Do This turns the switch on: Xm 7^ Xm-i- 

Step 3. Do This makes C = 2, and turns the switch on Xm-i = Xm- Note that the contents 

of the first n registers are now contained in (ir)pr[ 3 „]ypn]- 

Step 4. Do . At each iteration, this increases the counter C = (C -|- 1) mod (Q"' -|- 2), 

and so it computes the whole sequence p^‘^\ ... Note that the original value of (x)pr[„] 

is back in (x)pr[„]. 

Step 5. Do (F[(™-i)])(^“b(Q"+2) in order to compute £ — 1 iterations of p^^\ ■ ■ ■ 

Now, as in the proof of Theorem [TJ in order to simulate an arbitrary sequence p^*^^... ,pb^l g 

Tran(A”'), we use the program above to simulate £ times the full sequence p^‘^\ ■ ■ ■ □ 


{x)fm-i 

{x)fm 

Let 
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